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FIG- 2 
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ST3 Gal III activities of the Amylose purified 
refolded MBP-ST3Gal III fractions 




FIG. 3 
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FIG. 4 
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EPO 



GlycoPEGylation (20 K) of EPO 
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Full length UDP-N-acetylgalactosaminyltransferase 2 (GalNAcT2) 

, . . , nucleic acid and amino acid sequences 

Ammo aad sequence 

Met Arg Arg Arg Ser Arg Met Leu Leu Cys Phe Ala Phe Leu Trp Val 
1 5 10 15 

Leu Gly He Ala Tyr Tyr Met Tyr Ser Gly Gly Gly Ser Ala Leu Ala 
20 " 25 30 

Gly Gly Ala Gly Gly Gly Ala Gly Arg Lys Glu Asp Trp Asn Glu He 
• 35 " " 40 45 

Asp Pro He Lys Lys Lys Asp Leu His His Ser Asn Gly Glu Glu Lys 
50 55 60 

Ala Gin Ser Met Glu Thr Leu Pro Pro Gly Lys Val Arg Trp Pro Asp 
65 70 75 80 

Phe Asn Gin Glu Ala Tyr Val Gly Gly Thr Met Val Arg Ser Gly Gin 

85 90 "95 

Asp Pro Tyr Ala Arg Asn Lys Phe Asn Gin Val Glu Ser Asp Lys Leu 
100 105 110 

Arg Met Asp Arg Ala He Pro Asp Thr Arg His Asp Gin Cys Gin Arg 
115 ~ 120 " 125 

Lys Gin Trp Arg Val Asp Leu Pro Ala Thr Ser Val Val He Thr Phe 
130 135 140 

His Asn Glu Ala Arg Ser Ala Leu Leu Arg Thr Val Val Ser Val Leu 
145 150 155 160 

Lys Lys Ser Pro Pro His Leu He Lys Glu He He Leu Val Asp Asp 

165 170 175 

Tyr Ser Asn Asp Pro Glu Asp Gly Ala Leu Leu Gly Lys He Glu Lys 
180 185 190 

Val Arg Val Leu Arg Asn Asp Arg Arg Glu Gly Leu Met Arg Ser Arg 
195 200 " 205 

Val Arg Gly Ala Asp Ala Ala Gin Ala Lys Val Leu Thr Phe Leu Asp 
210 ~ 215 220 

Ser His Cys Glu Cys Asn Glu His Trp Leu Glu Pro Leu Leu Glu Arg 
225 230 ~ 235 240 

Val Ala Glu Asp Arg Thr Arg Val Val Ser Pro He He Asp Val He 

245 250 255 

Asn Met Asp Asn Phe Gin Tyr Val Gly Ala Ser Ala Asp Leu Lys Gly 
260 265 270 

Gly Phe Asp Trp Asn Leu Val Phe Lys Trp Asp Tyr Met Thr Pro Glu 
275 280 ~ 285 

FIG. 13A 
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Gin Arg Arg Ser Arg Gin Gly Asn Pro Val Ala Pro He Lys Thr Pro 
290 ~ ~ 295 300 

Met He Ala Gly Gly Leu Phe Val Met Asp Lys Phe Tyr Phe Glu Glu 
305 310 315 320 

Leu Gly Lys Tyr Asp Met Met Met Asp Val Trp Gly Gly Glu Asn Leu 

325 330 335 

Glu He Ser Phe Arg Val Trp Gin Cys Gly Gly Ser Leu Glu He He 
340 " 345 350 

Pro Cys Ser Arg Val Gly His Val Phe Arg Lys Gin His Pro Tyr Thr 
355 360 365 

Phe Pro Gly Gly Ser Gly Thr Val Phe Ala Arg Asn Thr Arg Arg Ala 
370 375 380 

Ala Glu Val Trp Met Asp Glu Tyr Lys Asn Phe Tyr Tyr Ala Ala Val 
385 390 " 395 400 

Pro Ser Ala Arg Asn Val Pro Tyr Gly Asn He Gin Ser Arg Leu Glu 

405 " 410 415 

Leu Arg Lys Lys Leu Ser Cys Lys Pro Phe Lys Trp Tyr Leu Glu Asn 
420 ~ 425 430 

Val Tyr Pro Glu Leu Arg Val Pro Asp His Gin Asp He Ala Phe Gly 
435 " 440 445 

Ala Leu Gin Gin Gly Thr Asn Cys Leu Asp Thr Leu Gly His Phe Ala 
450 " 455 460 

Asp Glv Val Val Gly Val Tyr Glu Cys His Asn Ala Gly Gly Asn Gin 
465 470 475 480 

Glu Trp Ala Leu Thr Lys Glu Lys Ser Val Lys His Met Asp Leu Cys 
* 485 490 495 

Leu Thr Val Val Asp Arg Ala Pro Gly Ser Leu He Lys Leu Gin Gly 
500 505 510 

Cys Arg Glu Asn Asp Ser Arg Gin Lys Trp Glu Gin He Glu Gly Asn 
515 520 525 

Ser Lys Leu Arg His Val Gly Ser Asn Leu Cys Leu Asp Ser Arg Thr 
530 535 540 

Ala Lys Ser Gly Gly Leu Ser Val Glu Val Cys Gly Pro Ala Leu Ser 
545 . 550 555 560 

Gin Gin Trp Lys Phe Thr Leu Asn Leu Gin Gin 

565 570 

FIG- 13A (CONT.) 
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Nucleic add sequence _^ ^ „ 

atgcggoggc gctcgoggat gctgctctgc ttogocttoc tgcgggLycL gggcatogoc 60 

tactacatgt actogggggg cggctcbgtg ctggaogggg gogogggogg oggogcoggc 120 

aggaaggagg actggaatga aattgaoaoc attaaaaaga aagaocttca tcacagcaat 180 

ggagaagaga aagcaeaaag catggagaoc ctaxtocag ggaaagtaog gtggocagac 240 

tttaaocagg aagctbabgt tggagggaog atggtoogct oogggcagga oact±aogoc 300 

cgcaacaagt tcaaocaggt ggagagtgat aagcttogaa tggacagagc catoactgac 360 

aoooggcatg aocagtgtca goggaagcag tggcgggtgg atctgocggc caacagcgtg 420 

gtgatcaogt ttaacaatga agooaggtog gocctactca ggaocgtggt cagogtgctt 480 

aagaaaagoc ogooocatct cataaaagaa atcatcttgg tggatgacta cagcaatgat 

ocbgaggaog gggctctctt ggggaaaatt gagaaagtgc gagttcttag aaatgatoga 

cgagaaggoc teatgcgctc aagggttcgg ggggoogatg ctgcxxaagc caaggtoctg 660 

aocttocbgg acagtoactg ogagbgtaat gagcacbggc tggagaaoct actggaaagg 

gtggcggagg acaggactog ggttgtgtca oocatcatag atgtcattaa tatggacaac 

tttcagtatg tgggggcatc tgctgacttg aagggcggtt ttgatbggaa ct±ggtattc 840 

aagtgggatt acatgacgoc tgagcagaga aggtoocggc aggggaaaac agtogoaact 900 

ataaaaaooc ccstgattgc tggtgggctg t±±gtgatgg ataagttcta tttfcgaagaa 960 

cbggggaagt aogacatgat gatggabgtg tggggaggag agaaoctaga gatctcgttc 1020 

cgcgtgtggc agtgtggtgg cagasbggag ateatoocgt gcagaogtgt gggacaogtg 

ttocggaagc agcaaoacta caogtbaaog ggtggcagtg gcactgtctt tgaacgaaac 

aoocgocggg cagcagaggt cbggatggat gaatacaaaa atttctatta tgcagcagtg 

octtctgcta gaaacgt±oc ttatggaaat attcagagca gattggagct taggaagaaa 

cteagctgca agactttcaa atggtacctt gaaaatgtct atocagagtt aagggttcca 1320 

gacratcagg abatagcttt tggggocttg cagcagggaa ctaactgact ogacactt±g 1380 

ggacactttg ctgatggtgt ggttggagtt tatgaatgtc acaatgctgg gggaaaooag 1440 

gaatgggcct tgacgaagga gaagtoggtg aagcacatgg atttgtgoct tactgtggtg 1500 

gacrgggcac ogggctctct tataaagcfcg cagggctgoc gagaaaatga cagcagacag 1560 

aaatgggaac agatogaggg caactocsag ctgaggcaog tgggcagcaa octgtgoctg 1620 

gacagtogca oggocaagag ogggggocta agcgtggagg tgtgtggooc ggaoctttog 1680 

csgcagtgga agttaaagct caaoctgcag cag 1713 

FIG. 13B 
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A51 UDP-N-acetylgalactosaminyltransferase 2, GalNAcT2, 

nucleic acid and amino acid sequences 

Amino acid sequence , _ _ .... _ ^ 

Lys Lys Lys Asp Leu His His Ser Asn Gly Glu Glu Lys Ala Gin Ser 
15 10 15 

Met Glu Thr Leu Pro Pro Gly Lys Val Arg Trp Pro Asp Phe Asn Gin 
20 25 30 

Glu Ala Tyr Val Gly Gly Thr Met Val Arg Ser Gly Gin Asp Pro Tyr 
35 ~" " 40 45 

Ala Arg Asn Lys Phe Asn Gin Val Glu Ser Asp Lys Leu Arg Met Asp 
50 55 60 

Arq Ala He Pro Asp Thr Arg His Asp Gin Cys Gin Arg Lys Gin Trp 
65 70 75 80 

Arq Val Asp Leu Pro Ala Thr Ser Val Val He Thr Phe His Asn Glu 

85 90 95 

Ala Arg Ser Ala Leu Leu Arg Thr Val Val Ser Val Leu Lys Lys Ser 
100 105 110 

Pro Pro His Leu He Lys Glu He He Leu Val Asp Asp Tyr Ser Asn 
115 120 125 

Asp Pro Glu Asp Gly Ala Leu Leu Gly Lys He Glu Lys Val Arg Val 
^130 135 140 

Leu Arg Asn Asp Arg Arg Glu Gly Leu Met Arg Ser Arg Val Arg Gly 

Ala Asp Ala Ala Gin Ala Lys Val Leu Thr Phe Leu Asp Ser His Cys 

165 170 l/o 

Glu Cys Asn Glu His Trp Leu Glu Pro Leu Leu Glu Arg Val Ala Glu 
180 185 190 

Asp Arg Thr Arg Val Val Ser Pro He He Asp Val lie Asn Met Asp 
195 200 205 

Asn Phe Gin Tyr Val Gly Ala Ser Ala Asp Leu Lys Gly Gly Phe Asp 
210 215 220 

Trp Asn Leu Val Phe Lys Trp Asp Tyr Met Thr Pro Glu Gin Arg Arq 
223 230 ' 235 240 

Ser Arg Gin Gly Asn Pro Val Ala Pro He Lys Thr Pro Met lie Ala 
* 245 250 255 

Gly Gly Leu Phe Val Met Asp Lys Phe Tyr Phe Glu Glu Leu Gly Lys 

Tvr Asp Met Met Met Asp Val Trp Gly Gly Glu Asn Leu Glu He Ser 
275 280 285 

Phe Arg Val Trp Gin Cys Gly Gly Ser Leu Glu He He Pro Cys Ser 

FIG. 14A 
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290 295 300 

Arg Val Gly His Val Phe Arg Lys Gin His Pro Tyr Thr Phe Pro Gly 
305 " 310 ~ 315 320 

Gly Ser Gly Thr Val Phe Ala Arg Asn Thr Arg Arg Ala Ala Glu Val 

325 " 330 335 

Trp Met Asp Glu Tyr Lys Asn Phe Tyr Tyr Ala Ala Val Pro Ser Ala 
340 345 350 

Arg Asn Val Pro Tyr Gly Asn lie Gin Ser Arg Leu Glu Leu Arg Lys 
355 ~ " 360 365 

Lys Leu Ser Cys Lys Pro Phe Lys Trp Tyr Leu Glu Asn Val Tyr Pro 
370 ~ " 375 380 

Glu Leu Arg Val Pro Asp His Gin Asp lie Ala Phe Gly Ala Leu Gin 
385 " 390 395 400 

Gin Gly Thr Asn Cys Leu Asp Thr Leu Gly His Phe Ala Asp Gly Val 

405 ' 410 415 

Val Gly Val Tyr Glu Cys His Asn Ala Gly Gly Asn Gin Glu Trp Ala 
420 425 430 

Leu Thr Lys Glu Lys Ser Val Lys His Met Asp Leu Cys Leu Thr Val 
435 ~ 440 445 

Val Asp Arg Ala Pro Gly Ser Leu lie Lys Leu Gin Gly Cys Arg Glu 
450 " 455 460 

Asn Asp Ser Arg Gin Lys Trp Glu Gin lie Glu Gly Asn Ser Lys Leu 
465 470 475 480 

Arg His Val Gly Ser Asn Leu Cys Leu Asp Ser Arg Thr Ala Lys Ser 

485 490 495 

Gly Gly Leu Ser Val Glu Val Cys Gly Pro Ala Leu Ser Gin Gin Trp 
500 505 510 

Lys Phe Thr Leu Asn Leu Gin Gin 
515 520 



FIG. 14A (CONT.) 
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Nucleic acid sequence 1 0/ °^ 

aaaaagaaag aacttcatca cagcaatgga gaagagaaag cacaaagcat ggagaaactc 60 

octocaggga aagtacggtg gccagacttt aaacaggaag cttatgttgg agggaogatg 120 

gtocgctocg ggcaggaocc ttacgoocgc aacaagtixa aocsggbgga gagbgataag 180 

cttcgaatgg acagagocat ocx±gacaac cggcatgaac agtgtcagag gaagcagtgg 240 

ogggtggatc tgocggocac cagcgtggtg atcacgttto acaatgaagc caggtoggcc 300 

ctactcagga cxgtggtxag ogbgcttaag aaaagccogc oocatctcat aaaagaaatc 360 

atcttggtgg atgactacag caatgatoct gaggacgggg ctctcttggg gaaaattgag 420 

aaagtgogag ttcttagaaa tgatagaaga gaaggoctca tgogctxaog ggttaggggg 480 

gocgatgctg oocaagocaa ggtoctgaoc ttocbggaca gtcactgoga gtgtaatgag 540 

cactggctgg agoooctact ggaaagggtg gcggaggaoa ggactogggt tgtgtcaooc 600 

atcatogatg tcattaatat ggacaacttt cagtatgtgg gggaatcbgc tgacttgaag 660 

ggcggttttg attggaactt ggtattcaag tgggat±aca tgaogcctga gcagagaagg 720 

tcxxggcagg ggaaoocagt ogoooctata aaaaooooca tgattgctgg tgggctgttt 780 

gtgatggata agttctattt togaagaactg gggaagtaog acatgatgat ggatgbgfcgg 840 

ggaggagaga aoctagagat ctogttoagc gtgtggcagt gtggtggcag octggagatc 900 

ataxgtgca gcogbgtggg acaogtgttc cggaagcagc acxxctacac gttorgggt 960 

ggcagtggca ctgtctttgc oogaaacaac ogoogggcag cagaggtctg gatggatgaa 1020 

tacaaaaatt tctattatgc agcagtgoct tctgctagaa aogttoctta tggaaatatt 1080 

cagagcagat tggagcttag gaagaaactc agcbgcaagc ctttcaaatg gtaccttgaa 1140 

aatgbctatc cagagttaag ggttocagac catcaggata tagcttttgg ggacttgcag 1200 

cagggaacta actgactoga cactttggga cactttgctg atggtgtggt tggagtttat 1260 

gaatgtcaca atgctggggg aaaocaggaa tgggacttga cgaaggagaa gtcggtgaag 1320 

cacatggatt tgbgocttac fagtggtggac ogggcaaogg gctctcttat aaagctgcag 1380 

ggctgocgag aaaatgacag cagacagaaa tgggaacaga togagggcaa ctacaagctg 1440 

aggcaogbgg ycageaaocb' gtgacbggac agtogcaogg acaagagogg gggoctaagc 1500 

gtggaggtgt gtggaooggc octttogcsg cagtggaagt tcaogctcaa acbgcagcag 1560 

FIG. 14B 
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Test pST3Ga!1 for activity after Hampton refold 



FIG. 25 
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Lanes: 1) See Blue Plus 2 

2) MBP-GalNAcT2 

3) MBP-GalNAcT2(conc) 

4) Co-refold 

5) Co-refold (cone) 

6) MBP-ST3Gal1 

7) MBP-ST3Gal1 (cone) 

MBP-GalNAcT2 
MBP-ST3Gal1 



FIG. 27A 
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Lanes: 

1) See Blue Plus 2 

2) Control enzymes Ohrs 

3) Control enzymes 16hrs 

4) Separate refold Ohrs 

5) Separate refold 16hrs 

6) Co-refold Ohrs 

7) Co-refolded 16hrs 



MBP-GalNAcT2 
MBP-ST3Ga!1 
IFa-2b-20K PEG 



IFoc-2b 



FIG- 27B 
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Human ST6GalNAcI 

MRSCLWRCRHLSQGVQWSLLLAVLWFLFALPSFIKEPQTKPSRHQRTENIKERSLQS 

LAKPKSQAPTRARRTTIYAEPVPENNALNTQTQPKAHTTGDRGKEANQAPPEEQDK 

WHTAQRAAWKSPEKEKTMVNTLSPRGQDAGMASGRTEAQSWKSQDTKTTQGNG 

GOTRKLTASRTVSEKHQGKAATTAKTLIPKSQHRMLAPTGAVSTRTRQKGVTTAVIP 

PKEKKPQATPPPAPFQSPTTQRNQRLKAANFKSEPRWDFEEKYSFEIGGLQTTCPDSV 

KIKASKSLWLQKLFLPNLTLFLDSRHFNQSEWDRLEHFAPPFGFMELNYSLVQKVVT 

RFPPVPQQQLLLASLPAGSLRCITCAVVGNGGILNNSHMGQEIDSHDYVFRLSGALIK 

GYEQDVGTRTSFYGFTAFSLTQSLLILGNRGFBGWPLGBaDVRYLHFLEGTRDYEWLE 

ALLMNQWMSKNLFWFRHRPQEAFREALHMDRYLLLHPDFLRYMKNRFLRSKTLD 

GAHWRIYRPTTGALLLLTALQLCDQVSAYGFITEGHERFSDHYYDTSWKRLIFYINH 

DFKLEREVWKRLHDEGIIRLYQRPGPGTAKAKN 

FIG- 38A 

Chicken ST6GalNAcI 

MGFLIRRLPKDSRIFRWLLILTVFSFnTSFSALFGMEKSIFRQLKIYQSIAHMLQVDTQ 

DOQGSNYSANGRISKVGLERDIAWLELNTAVSTPSGEGKEEQKKTVKPVAKVEEAK 

EKVTVKPFPEVMGITNTTASTASVVERTKEKTTARPVPGVGEADGKRTTIALPSMKE 

DKEKATVKPSFGMKVAHANSTSKDKPKAEEPPASVKAIRPVTQAATVTEKKKLRAA 

DFKTEPQWT)FDDEYILDSSSPVSTCSESVRAKAAKSDWLRDLFLPNITLFIDKSYFNV 

SEWDRLEHFAPPYGFMELNYSLVEEVMSRLPPNPHQQLLLANSSSNVSTCISCAWG 

NGGILNNSGMGQEIDSHDYVFRVSGAVIKGYEKDVGTKTSFYGFTAYSLVSSLQNLG 

HKGFKKIPQGKHIRYIHFLEAVRDYEWT.KAELLDKDIRKGFLNYYGRRPRERFDEDF 

TMNKYLVAHPDFLRYLKNRFLKSKNLQKPYWRLYRPTTGALLLLTALHLCDRVSAY 

GYITEGHQKYSDHYYDKEWKRLVFYVNHDFNLEKQVWKRLHDENIMKLYQRS 

FIG. 38B 

Mouse ST6GalNAcI protein beginning at residue 32 of the native mouse protein 

DPRAKDSRCQFIWKM)ASAQENQQKAEPQWIMTLSPRVHNKESTSVSSKDLKKQER 

EAVQGEQAEGKEKRKLETIRPAPENPQSBCAEPAAKTPVSEHLDKLPRTPGALSTRKTP 

MATGAWAKKKVVQATKSPASSPHPTTRRRQRLKASEFKSEPRWDFEEEYSLDMSSL 

QTNCSASVTOKASKSPWEQNIFLPMTLFLDSGRFTQSEWNRLEHFAPPFGFMELNQSL 

VQKWTRFPPVRQQQLLLASLPTGYSKCITCAWGNGGILNDSRVGREIDSHDYVFR 

LSGAVIKGYEQDVGTRTSFYGFTAFSLTQSILILGRRGFQHVPLGKDVRYLHFLEGTR 

NYEWLEAMFLNQTLAKTHLSWFRHRPQEAFRNALDLDRYLLLHPDFLRYMKJSIRFL 

RSKTLDTAHWRIYRPTTGALLLLTALHLCDKVSAYGFITEGHQRFSDHYYDTSWKRL 

IFYINHDFRLERMWKRLHDEGirWLYQRPQSDKAKN 

FIG. 38C 
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MASKSWLNFLTFLCGSAIGFLLCSQLFSILLGEKVDTQPNVLHNDPHARHSDDNGQN 

HLEGOMNFN AD SSQHKDENTDIAENLYQKVRILCWVMTGPQNLEKKAKHVKATW 

AORCNKVLFMSSEENKDFPAVGLKTKEGRDQLYWKTIKAFQYVHEHYLEDADWFL 

KADDDTYVILDNLRWLLSKYDPEEPIYFGRRFKPYVKQGYMSGGAGYVLSKEALKR 

FVDAFKTDKCTHSSSIEDLALGRCMEIMNVEAGDSRDTIGKETFHPFVPEHHLIKGYL 

PRTFWYWNYNYYPPVEGPGCCSDLAVSFHYA03STTMYELEYLVYHLRPYGYLYRY 

QPTLPERILKEISQANKNEDTKVKLGNP 



FIG. 41 
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Yersinia MBP 

MKIEEGKLVI 
VAATGDGPDI 
NGKLIGYPVA 
NLQEPYFTWP 
KNKHINADTD 
FHGQPSKPFV 
GAVALKS FQE 
AITGRQTVEA 



E. coliWL&P 

MKIEEGKLVI 
VAATGDGPDI 
NGKLIAYPIA 
NLQEPYFTWP 
KNKHMNADTD 
FKGQPSKPFV 
GAVALKSYEE 
AAS GRQTVDE 



WI3SJGDKGYNG 
IFWAHDRFGG 
VEALSLIYNK 
VIAADGGYAF 
YSIAEAAFNK 
GVLTAGINAA 
QLAKDPRIAA 
ALNDAATRIT 
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LAEVGKKFEK 
YAQSGLLAEL 
DLVKEAPKTW 
KFENGVYDAK 
GETAMTINGP 
SPNKELATEF 
TMDNATNGEI 
K 



DTGIKVTIEH 
TPSKAFQEKL 
EEIPALDKTL 
NVGVNNAGAQ 
WAWSNIDKSK 
LENYLITDQG 
MPNIPQMAAF 



PDKLEEKFPQ 
FPFTWDAVRF 
RANGKSAIMW 
AGLQFIVDLV 
INYGVTLLPT 
LAEVNKDKPL 
WYATRSAVLN 



FIG. 43 A 



WINGDKGYNG 
IFWAHDRFGG 
VEALSLIYNK 
LIAADGGYAF 
YSIAEAAFNK 
GVLSAGINAA 
ELAKDPRIAA 
ALKDAQTNS 



LAEVGKKFEK 
YAQSGLLAEI 
DLLPNPPKTW 
KYENGKYDIK 
GETAMTINGP 
SPNKELAKEF 
TMENAQKGEI 



DTGIKVTVEH 
TPDKAFQDKL 
EEIPALDKEL 
DVGVDNAGAK 
WAWSNIDTSK 
LENYLLTDEG 
MPNIPQMSAF 



PDKLEEKFPQ 
YPFTWDAVRY 
KAKGKSALMF 
AGLTFLVDLI 
VNYGVTVLPT 
LEAVNKDKPL 
WYAVRTAVIN 



FIG. 43B 

Pyrococcus furiosus MBP 

MKIEEGKWI WHAMQPNELE VFQSLAEEYM ALSPEVEIVF EQKPNLEDAL 
KAAIPTGQGP DLFIWAHDWI GKFAEAGLLE PIDEYVTEDL LNEFAPMAQD 
AMQYKGHYYA LPFAAETVAI IYNKEMVSEP PKTFDEMKAI MEKYYDPANE 
KYGIAWPINA YFISAIAQAF GGYYFDDKTE QPGLDKPETI EGFKFFFTEI 
WPYMAPTGDY NTQQSIFLEG RAPMMVNGPW SINDVKKAGI NFGWPLPPI 
IKDGKEYWPR PYGGVKLIYF AAGIKNKDAA WKFAKWLTTS EESIKTLALE 
LGY I PVLTKV LDDPEIKNDP VIYGFGQAVQ HAYLMPKSPK MSAVWGGVDG 
AINEILQDPQ NADIEGILKK YQQEILNNMQ G 



FJG. 43C 

Thermococcus litoralis MBr 

MKIEEGKIVF AVGGAPNE IE YWKGVIAEFE KKYPGVTVEL KRQATDTEQR 
RLDLVNALRG KSSDPDVFLM DVAWLGQFIA SGWLEPLDDY VQKDNYDLSV 
FFQSVINLAD KQGGKLYALP VYIDAGLLYY RKDLLEKYGY SKPPETWQEL 
VEMAQKIQSG ERETNPNFWG FVWQGKQYEG LVCDFVEYVY SNGGSLGEFK 
DGKWVPTLNK PENVEALQFM VDLIHKYKIS PPNTYTEMTE EPVRLMFQQG 
NAAFERNWPY AWGLHNADDS PVKGKVGVAP LPHFPGHKSA ATLGGWHIGI 
SKYSDNKALA WEFVKFVESY SVQKGFAMNL GWNPGRVDVY DDPAWSKSP 
HLKELRAVFE NAVPRPIVPY YPQLSEIIQK YVNSALAGKI SPQEALDKAQ 
KEAEELVKQY SK 

FIG. 43D 

Thermatoga maritime MBP 

MKIEQTKLTI WSSEKQVDIL QKLGEEFKAK YGIPVEVQYV DFGSIKSKFL 
TAAPQGQGAD IIVGAHDWVG ELAVNGLIEP IPNFSDLKNF YDTALKAFSY 
GGKLYGVPYA MEAVALIYNK DYVDSVPKTM DELIEKAKQI DEEYGGEVRG 
FIYDVANFYF SAPFILGYGG YVFKETPQGL DVTDIGLANE GAVKGAKLIK 
RMIDEGVLTP GDNYGTMDSM FKEGLAAMI I NGLWAIKSYK DAGINYGVAP 
IPELEPGVPA KPFVGVQGFM INAKSPNKVI AMEFLTNFIA RKETMYKIYL 
ADPRLPARKD VLELVKDNPD WAFTQSASM GTPMPNVPEM APVWSAMGDA 
LSIIINGQAS VEDALKEAVD KIKAQIEK 

FIG. 43E 
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Vibrio cholerae MBP 

MKIEEGQLTI WINGDKGYNG LAEVGKKFEA DTGIKVTVAH PDALQDKFPQ 
TAATGDGPDI VFWAHDRFGG YAEAGLLVEI KPSAKIQEGI VDFAWDAVKY 
NGKIIGYPIA VESLSLIYNK DLVPNPPKSW EEVAELDAKL KKEGKSAIMW 
NLKEPYFTWP LMAADGGYAF KYGVDGYDVK DAGINNKGVK DAMNFVKGLV 
DKGVISPDMD YSVSESAFNQ GNTAMTINGP WSWGNIEKSG INYGVTTLPK 
FNGQASKPFV GVLTAGISTA SPNKDLAVEF IENYLLTNDG LRMVNNDKPL 
GAVALNS FQR ELDADARIAA TMDNAMNGEI MPNI PQMNAF WSSAKNAIIN 
IVDGRQTVDA ALADAEKQMT KP 



FIG. 43F 
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